CDS
Accession Number | TCMCG075C26296 |
gbkey | CDS |
Protein Id | XP_007013379.2 |
Location | join(6621560..6621731,6622122..6622204,6622628..6622722,6623718..6623821,6624229..6624398,6624526..6624640,6625080..6625135,6625224..6625262,6625433..6625531,6625806..6625868,6625987..6626049,6626163..6626245,6626418..6626526,6626626..6626679,6627248..6627335,6627514..6627551,6630189..6630294,6631068..6631234,6631392..6631439,6631564..6631614) |
Gene | LOC18588725 |
GeneID | 18588725 |
Organism | Theobroma cacao |
Protein
Length | 600aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007013317.2 |
Definition | PREDICTED: imidazole glycerol phosphate synthase hisHF, chloroplastic isoform X3 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAGGGGGTGCCATATGCTTACACTACAAGCTTCAAAACACAATTATTTTTGTCGTCTGCACTGTCATCATCGTCTATTATAACCATCCACCAAAAGCGTCACAAAACTATTTTAAAATCCATATCTCGTAGAAATCTTGTTATCTGTGCTTCATCTGGTTCTAGTTCTGTTGTGAAGTTGCTTGATTATGGAGCTGGAAATGTTCGGAGCTTAAGGAATGCTATTCACCATCTTGGCTTTGAGATAGAGGATGTGCAAACTCCAAAAGACATTTTGGATGCTGAACGCCTTATCTTTCCTGGTGTTGGGGCATTTGCTTCAGCCATGGATGTATTGGTCAAGACCGGGATGGCTGACGCACTTTGTTCCTATATCAAGAATGATCGCCCATTTCTAGGCATTTGTCTTGGCCTTCAACTACTTTTTGAGTCTAGTGAAGAGAATGGACCAGTGAATGGTCTAGGCTTGATACCTGGTGTGGTTGGGCGGTTTAACTCTTCAAATGGTTTTAGAGTACCCCATATTGGCTGGAATGCTTTGCAAATTACAAAAGACTCTGAAATTTTGGATGACATTGGAGATCACCATGTCTACTTTGTTCACTCTTACCGTGCCATGCCATCAGATGATAACAAGGAATGGATTTCATCTACATGCAATTATGGTGATGATTTTATAGCGTCTATCAGAAGGGGAAATGTGCATGCAGTTCAGTTCCATCCAGAGAAGAGTGGAGATGTTGGTCTTTCTGTATTGAGAAGGTTTCTAGATCCAAAGTCACAGGGGACAAAGAATCTTACTCAGGGGAAGGCTTCAAAACTTGCTAAGAGGGTGATTGCTTGTCTTGATGTTAGGACGAATGATAAGGGGGATCTTGTTGTCACCAAAGGGGACCAGTATGATGTACGAGAGCACACAAAAGAGAATGAGGTGAGAAACCTTGGCAAACCTGTGGAGCTTGCTGGACAGTATTACAAAGATGGGGCTGATGAGGTCAGTTTTTTGAACATTACTGGCTTCCGTGACTTCCCATTAGGCGATTTACCAATGTTGCAGGTATTAAGACGCACTTCAGAGAATGTTTTTGTCCCACTAACGGTCGGAGGTGGTATACGAGATTTTACAGATGCAAATGGCAGGCACTATTCTAGTTTGGAGGTTGCTTCAGAGTACTTTAGGTCTGGGGCTGATAAAATTTCCATTGGGAGTGATGCAGTTCATGCAGCAGAAGAATATATGAAAACCAAAGTAAAGACAGGAAAGAGCAGCTTAGAACAAATTTCTAAAGTCTATGGAAATCAGGCAGTAGTTGTAAGCATTGATCCTCGTAGAGTGTACCTTAAAAGTCCTAATGATGTGCAGTTCAAGACCATAAGGGTCACAAAACCAGGTCCAAGTGGAGAAGAATATGCTTGGTATCAGTGTACGGTTAATGGTGGGCGTGAAGGTCGACCAATTGGGGCTTATGAGCTTGCAAAAGTAGTTGAAGAACTGGGAGCTGGAGAAATACTATTGAACTGCATTGATTGTGATGGTCAAGGAAAAGGATTTGATATAGATTTAATAAAGCTGATATCAGATGCTGTCAGCATCCCTGTAATTGCAAGTAGTGGTGCCGGTGCTGTTGAACACTTCTCGGAGGTATTCATGAAAACAAATGCATCAGCAGCTCTTGCTGCTGGCATTTTCCATCGGAAGGAGGTGCCCATTCAGTCTGTAAAAGAACACTTGTCGAAGGAAGGCATTGAATTTCGGGATTGCTTTCGAAATGATCTGTTCGTATGTAATCATTATATTTGA |
Protein: MEGVPYAYTTSFKTQLFLSSALSSSSIITIHQKRHKTILKSISRRNLVICASSGSSSVVKLLDYGAGNVRSLRNAIHHLGFEIEDVQTPKDILDAERLIFPGVGAFASAMDVLVKTGMADALCSYIKNDRPFLGICLGLQLLFESSEENGPVNGLGLIPGVVGRFNSSNGFRVPHIGWNALQITKDSEILDDIGDHHVYFVHSYRAMPSDDNKEWISSTCNYGDDFIASIRRGNVHAVQFHPEKSGDVGLSVLRRFLDPKSQGTKNLTQGKASKLAKRVIACLDVRTNDKGDLVVTKGDQYDVREHTKENEVRNLGKPVELAGQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRRTSENVFVPLTVGGGIRDFTDANGRHYSSLEVASEYFRSGADKISIGSDAVHAAEEYMKTKVKTGKSSLEQISKVYGNQAVVVSIDPRRVYLKSPNDVQFKTIRVTKPGPSGEEYAWYQCTVNGGREGRPIGAYELAKVVEELGAGEILLNCIDCDGQGKGFDIDLIKLISDAVSIPVIASSGAGAVEHFSEVFMKTNASAALAAGIFHRKEVPIQSVKEHLSKEGIEFRDCFRNDLFVCNHYI |